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ABSTRACT 

We study the effect of imperfect subtraction of the Sunyaev-Zel'dovich effect (SZE) 
using a robust and non-parametric method to estimate the SZE residual in the Planck 
channels. We include relativistic corrections to the SZE, and present a simple fit- 
ting formula for the SZE temperature dependence for the Planck channels. We show 
how the relativistic corrections constitute a serious problem for the estimation of the 
kinematic SZE component from Planck data, since the key channel to estimate the 
kinematic component of the SZE, at 217 GHz, will be contaminated by a non-negligible 
thermal SZE component. The imperfect subtraction of the SZE will have an effect on 
both the Planck cluster catalogue and the recovered CMB map. In the cluster cata- 
logue, the relativistic corrections are not a major worry for the estimation of the total 
cluster flux of the thermal SZE component, however, they must be included in the 
SZE simulation when calculating the selection function and completeness level. The 
power spectrum of the residual at 353 GHz, where the intensity of the thermal SZE is 
maximum, does not contribute significantly to the power spectrum of the CMB. We 
calculate the non-Gaussian signal due to the SZE residual in the 353 GHz CMB map 
using a simple Gaussianity estimator, and this estimator detects a 4.25tT non-Gaussian 
signal at small scales, which could be mistaken for a primordial non-Gaussian signa- 
ture. The other channels do not show any significant departure from Gaussianity with 
our estimator. 



1 INTRODUCTION 



The quality of the data from the Planck satellite will allow 
us to make precision cosmology, however, accurate param- 
eter extraction will require precise modeling of the data. 
The scientific possibilities with the new data will depend 
strongly on the ability to perform the component separa- 
tion. In each of the Planck channels the data will be a 
mixture of galactic components (synchrotron, free-free and 
dust), extra-galactic components (unresolved galaxies and 
galaxy clusters), and instrumental noise. Recently several 
algorithms have been proposed to perform such component 
separation. Most of these methods rely on a priori knowl- 
edge of the frequency dependence of each individual compo- 
nent and their power spectrum (maximum entropy, (Hobson 
et al. 1998); multi-frequency Wiener filter, (Tegmark & Ef- 
stathiou, 1996; Bouchet & Gispert 1999)). The advantage 
of these methods is that they can recover all the difi'erent 
components simultaneously, however, the drawback is that 
if some of the assumptions about the frequency dependence 
and/or power spectrum is wrong, then the final result will 
be biased. Other methods are designed to recover just one 
of the components, and the most popular ones focus on the 
recovery of compact sources. Since the best resolution of 
Planck is 5 arcmin, the extra-galactic galaxies will appear 



as unresolved point sources with a shape matching the point 
spread function of the instrument. This fact can be used to 
define optimal filters which will increase the signal to noise 
ratio of the bright point sources, thus allowing the detection 
and removal of most of them (Tegmark & de Oliveira-Costa, 
1998; Sanz et al. 2001). A similar technique can be applied 
to the detection of clusters if one assumes a circular shape. 
Since the typical diameter of a cluster is a few arcmin, most 
of the clusters will appear similar to the unresolved point 
sources, and only some will be resolved by the instrument. 
The definition of the optimal filter is a bit more complicated 
in this case since the optimal scale of the filter will be differ- 
ent for each cluster, however, this problem can be partially 
solved by filtering the maps with different scales (Herranz 
et al. 2001). 

An alternative non-parametric approach to detect clus- 
ters in Planck data was recently proposed (Diego et al. 
2002). In that method no assumptions are made about the 
specific frequency dependence of the different components 
(except for the SZE component), and also no assumption 
about the power spectrum of the components or scale (or 
symmetry) of the clusters. The only assumption is that the 
frequency dependence of the SZE (in the non-relativistic ap- 
proximation) is known. Even without many of the typical 
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assumptions, the recovered SZE component is a good es- 
timate of the total contribution of galaxy clusters to the 
different Planck channels, however, the SZE component re- 
covered by this method (as well as by other methods) does 
not match the real SZE component perfectly in the simu- 
lations. Therefore, a residual will be left in the final CMB 
map due to the non-perfect subtraction of the SZE from the 
original data. So far there have been no attempts to study 
how this SZE residual could affect the conclusions derived 
from the residual contaminated CMB map. One of the rea- 
sons is that the SZE residual depends on the method used 
to make the component separation, and different methods 
recover the SZE component with different quality factors 
and residuals. Consequently, the residual SZE map (defined 
as the "true" minus the "recovered") is different for each 
method. In this work we will study the SZE residual from 
the non-parametric method proposed in (Diego et al. 2002). 
Since this method makes a minimum number of assump- 
tions, the results are robust and less affected by systematic 
errors which could be introduced through wrong assump- 
tions. The conclusions of this work will therefore set an up- 
per limit on the contributions of the SZE residual to the 
CMB. Any other method which makes more assumptions 
than our non-parametric method, should leave a smaller 
SZE residual (provided the assumptions are correct), and 
as a consequence the effect of the SZE residual for any other 
(suitable) method should be below the limits we present 
here. It is worth pointing out that since the non-parametric 
method adopted here is optimized for the detection of the 
SZE signal, then alternative methods making many more 
assumptions do not necessarily obtain a better reconstruc- 
tion of the SZE signal. Therefore, although our approach 
provides an upper limit on the effects of the SZE residual, 
then that limit can be taken as a realistic estimate of the 
final contribution of the SZE residual to the CMB map. 

In this work we will consider an additional source of 
systematic error which has not been considered previously, 
namely the relativistic corrections to the SZE. Although 
these relativistic corrections are small for normal clusters 
(T « few keV), they can be important for massive clus- 
ters (T ~ 10 kcV). The cluster selection function of Planck 
(minimum mass detected as a function of redshift) rises 
very quickly from redshift to redshift « 0.2, and above 
that redshift it is almost flat (see Fig. 3 below). This means 
that above redshift « 0.2 Planck will only see massive clus- 
ters for which the relativistic corrections are important. 
In all component separation methods (including the non- 
parametric method considered in this work), the frequency 
dependence of the SZE component is assumed to follow the 
non- relativistic form, eq. (4) . The validity of this approach is 
unknown, and it is therefore worth while exploring the effect 
of the relativistic corrections in the context of the compo- 
nent separation process. 

The main effect of the relativistic corrections can be de- 
scribed effectively as a dilution of the frequency dependence 
of the SZE. At low frequencies {v < 217 GHz), the relativis- 
tic correction lowers the absolute value of the SZE intensity 
change. The same thing happens at higher frequencies up to 
f ~ 400 GHz. Above this frequency, the intensity change is 
larger than the one given by the non-relativistic approach 
(see Fig. 1), however, the dust contamination is very impor- 
tant at such large frequencies, and hence the effect of the 
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Figure 1. Solid line is proportional to the non-relativistic inten- 
sity change which is assumed in all existing component separation 
algorithms. The dashed (dotted) line shows the frequency depen- 
dence of the SZE when the relativistic corrections are included 
for a cluster with T = 10 keV (T = 20 keV). 

relativistic corrections becomes negligible for v > 400 GHz. 

The non-relativistic form is systematically assumed in 
all component separation algorithms, however, the dilution 
effect due to the inclusion of the relativistic correction makes 
the form of the SZE for the hottest clusters different from 
the non-relativistic form. This will lead to an additional er- 
ror in the residual of the SZE signal. Our non-parametric 
method makes the (wrong) assumption that the relativistic 
corrections are negligible for all the clusters in our simula- 
tion. However, we will test the method with SZE simulations 
where the relativistic corrections are incorporated. By do- 
ing that, we are simulating the realistic case in which the 
temperature of the cluster is not known, and therefore the 
non-relativistic form must be assumed in the algorithm to 
recover the SZE component. This wrong assumption in our 
method (and in all other methods) will add an additional 
error in the recovered SZE map. 

The cluster abundance is a sensitive probe of cosmologi- 
cal parameters such as the matter density, fi^ , and when the 
observations reach low statistical error it becomes important 
to control the systematic errors in the parameter extraction. 
The relativistic corrections to the SZE could add an addi- 
tional error in the estimate of the cluster number counts, 
and we will below consider the importance hereof. 

To study the effect of the SZE residual on the CMB we 
will focus on two aspects of the CMB, namely its power spec- 
trum and its Gaussian nature. The power spectrum depends 
strongly on the cosmological parameters, and a systematic 
error in the estimation of the power spectrum due to non- 
subtracted residuals could have important consequences for 
the best fitting cosmological model. Gaussianity is a natu- 
ral prediction of single field infiationary models, and a non- 
Gaussian signature in the CMB could have important con- 
sequences for such inflationary models. The SZE signal is 
very non-Gaussian and so is the non-subtracted residual. In 
previous works (see (Aghanim & Forni 1999; Cooray 2001; 
Rephaeli 2001; Yoshida et al. 2001) and references therein), 
the non-Gaussian signature of the SZE has been studied, but 
these works focus on the entire contribution of this compo- 
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nent. Since a large part of the SZE signal will be removed 
in the component separation process, the non-Gaussian sig- 
nature, if any, will be smaller than predicted in previous 
works, however, it could still be significant. It is important 
to understand how this residual could leave a non-Gaussian 
imprint in the CMB map, such that an erroneous interpreta- 
tion as of primordial nature of a non-Gaussian signature can 
be avoided. In this work we will study the implications on 
Gaussianity studies of the SZE residual left after component 
separation. 

The structure of the paper is the following. In section 2 
we will give a brief description of the SZE and the relativis- 
tic corrections. We also present a simple fitting formula to 
the temperature dependence of the SZE in the central fre- 
quencies of the Planck channels. This fitting formula could 
be used in future works to include the relativistic corrections 
in the simulations. In section 3 we apply the non-parametric 
method to realistic Planck simulations and we recover the 
SZE component in two cases. In the first case we simulate a 
population of clusters all with the same temperature. This 
allows us to assume the real frequency dependence of the 
SZE in the non-parametric component separation method 
and sec what the difference is with the case when the non- 
relativistic form is assumed in the component separation. In 
the second test, we make realistic simulations of the clus- 
ters with the population of clusters having different tem- 
peratures. We include the relativistic correction in our sim- 
ulation. Then we calculate the SZE residual assuming the 
non-relativistic approach for the frequency dependence of 
the SZE. In section 4 we discuss the effects of the imperfect 
SZE recovery on the Planck cluster catalogue focusing our 
attention on the systematic errors introduced in the recov- 
ered SZE map by the relativistic corrections and the prob- 
lems this causes for the kinematic SZ component. In section 
5 we discuss the effect of the SZE residual on the CMB 
map with emphasis on non-Gaussian signatures. Finally we 
present our conclusions in section 6. 



2 THE SUNYAEV-ZEL'DOVICH EFFECT 
WITH RELATIVISTIC CORRECTIONS 

In this section we will give a brief description of the SZE and 
the relativistic corrections (see e.g. the reviews (Birkinshaw 
1999; Carlstrom et al. 2001) for more details). 

As the photons of the cosmic background radiation 
(CMB) traverses a cluster of galaxies they may scatter on 
the free electrons in the ionized gas and produce the thermal 
Sunyaev-Zel'dovich effect (Sunyaev & Zeldovich 1972). The 
resulting intensity change of the CMB is proportional to the 
Comptonization parameter. 



J 'me 



riefTTh 



(1) 



where Te is the temperature of the electron gas in the cluster, 
me the electron mass, Ue the electron number density, crxh 
the Thomson scattering cross section, and the integral is 
calculated along the line of sight through the cluster. Wo 
use units where ks = h = c = 1. For an intra-cluster gas 
which can be assumed isothermal, one has yc = rTe/me, 
where r is the optical depth. 
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Table 1 — Fit parameters a^, fS^ and 7^ for the expected Planck 
central frequencies. For each frequency the coefficients come from 
AIt^(i')/{Io He) = ct^+/3^T + "f^T^ where T is measured in keV. 



The intensity change is given by 
A/t = Ioycf{x,T), 
with 

f{x,T) = f{x) + Sf{x,Te) 
and 



f{x) = f{x,Q) 



{e^ - 1)2 



x(e" + 1) 
- 1 



(2) 



(3) 



(4) 



where x = v/Tcms is the dimensionless frequency (Tcmb = 
2.725 K), and Jo = Tq^i-q / {21^"^) . The intensity change is in- 
dependent of the temperature for non-relativistic electrons, 
5f{x,Te) — 0, a limit which is valid for small frequencies 
{v ^ 100 GHz), but for high frequencies it must be cor- 
rected (Wright 1979; Rephaeli 1995) with 6f{x,Te) using 
cither an expansion in Te/nie (Stebbins 1997; Challinor & 
Lasenby 1998; Itoh, Kohyama & Nozawa 1998; Itoh et al. 
2001) or calculated exactly (Dolgov ct al. 2001). One can 
use these relativistic corrections to find the temperature of 
distant clusters (Hansen, Pastor & Semikoz 2002), and pos- 
sibly even to find the temperature of clusters in the Planck 
catalogue (Pointecouteau, Giard & Barret 1998). 

Since the exact calculation of the relativistic correction 
is time consuming, it is more convenient to use a fit to this 
correction as a function of the temperature. We have calcu- 
lated such a fit in the range T € [0.5, 20] keV and we found 
that a fit of the form. 



/(x,T) = a„+ ,a.T + 7, 



(5) 



is rather accurate (where T is given in kcV). Naturally one 
should have — f{x,0), however, since the fit is optimized 
for T = 1 - 20 keV, the T w fimit is slightly different. 
We use the exact calculations of Dolgov et al. (2001) and 
fit for each of the central Planck frequencies. The resulting 
parameters a, /3 and 7 are presented in table 1 *. 

It is important to keep in mind, that these fit param- 
eters have been calculated while neglecting bandwidth. In 
a real observational situation the proper inclusion of band- 
width is crucial, e.g. a Gaussian frequency bandwidth of 
35 GHz will produce 5% less intensity change for the 353 
GHz channel. However, this intensity change is about 5% 
both, when considering only the non-relativistic form and 

* This table (and practical details) can be found on 
http : //www-astro . physics . ox . ac. uk/"hansen/sz/ 
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when considering the full relativistic treatment, so the re- 
sults presented in this paper would basically be identical 
when including a bandwidth. 



3 THE RECOVERED SZE MAP AND ITS 
RESIDUAL 

Since our present knowledge of the galactic and extra- 
galactic components is limited, the proposed algorithms for 
component separation will leave a galactic and extra-galactic 
residual in the CMB map, and as a consequence the power 
spectrum of the CMB will be distorted. This distortion will 
be particularly important at small scales (0 fs 5 arcmin, 
/ sa 2000). The physics extracted from this part of the power 
spectrum will therefore be limited by the accuracy achieved 
in the component separation or cleaning process which will 
add a systematic error in the CMB power spectrum. 

In this work we want to investigate one of these sources 
of systematic errors, namely the imperfect recovery of the 
SZE including relativistic corrections. Usually it has been 
assumed in component separation algorithms that the SZE 
frequency dependence is described by the non-relativistic 
form, eq. (4). There are several reasons to make this as- 
sumption. First, the relativistic correction is small for many 
clusters. Second, the temperature of the clusters is unknown 
for almost all the clusters and consequently it is not possible 
to calculate the relativistic correction for the individual clus- 
ter. Finally, the different component separation algorithms 
are much simpler if the same frequency dependence is as- 
sumed for all the clusters in the map. However, the non- 
relativistic assumption will only be valid for those clusters 
where the temperature is small (typically a few keV). For 
massive (hot) clusters the relativistic correction can be im- 
portant. For instance, for a cluster with T « 10 keV , this 
correction is about 15 % at frequencies near 353 GHz (sec 
table 1), and consequently there will be an additional error 
in the estimate of the signal of the hot clusters. This error 
will leave an imprint in the CMB when the incorrectly es- 
timated signal of these hot clusters is removed. Since most 
of the clusters will be unresolved, this error will contribute 
only to the small scales of the CMB. Whether or not this er- 
ror can distort the power spectrum of the CMB significantly 
at small scales is still an open question. 

The effects of the non-relativistic assumption could also 
be important for the studies based on the cluster number 
counts. Since the non-relativistic assumption will introduce 
some error in the estimate of the flux of hot clusters, the 
number counts as a function of flux will be affected by this 
error. It is important to quantify this error when such data 
sets are going to be used to extract cosmological parameters. 

It is therefore important to investigate with simulations 
whether or not the effects of the non-relativistic assumption 
are important in the component separation process and if 
they could represent a serious problems for the estimate of 
the cosmological model which best describes the CMB data. 
The simulations of the Planck data used in this work are 
the same as those described in (Diego et al. 2002) and the 
reader is referred to that paper for more details. Our simu- 
lations include the following components; galactic emission 
(synchrotron, free- free, and dust), point sources, SZE, CMB 
and instrumental noise. We also take into account the differ- 



ent resolutions in each channel. The simulations correspond 
to an area of the sky of 12.8° x 12.8°, and each map contains 
512^ pixels. The cosmological model assumed for the CMB 
and the SZE simulation is consistent with recent CMB and 
cluster abundance observations {flm = 0.3, A = 0.7, n = 1, 
r = 0.2, 0-8 = 0.8). 

3.1 Component separation. Non-parametric 
method 

We will here give a brief summary of how to subtract the 
SZE component from the simulated Planck maps. A detailed 
description of the method can be found in (Diego et al. 
2002). 

The method is basically a non-parametric Bayesian ap- 
proach. It is non-parametric because it does not make any 
of the typical assumptions about the components, such as 
knowledge of the power spectrum and frequency depen- 
dence. The only assumption made in this method is that we 
know the frequency dependence of the SZE. It also makes the 
(wrong) assumption that the SZE signal is a Gaussian vari- 
able. However, as described in (Diego et al. 2002) , when this 
assumption is made over the Fourier modes of the Compton 
parameter map, these modes follow a probability distribu- 
tion function which is much closer to a Gaussian than in real 
space. The method performs a basic cleaning of the maps 
where a first estimate of the point source, dust and CMB 
contributions is removed. Then the Bayes theorem is ap- 
plied on the clean maps and an estimate of the SZE map is 
obtained. 



where i/c's are the Fourier coefficients of the Compton pa- 
rameter map. d is the vector containing the Fourier coeffi- 
cients of the clean maps, C~^ is the inverse of the correlation 
matrix of the clean maps, R is another vector (same dimen- 
sion as d) containing the Fourier transform of the beam 
response of the instrument and the frequency dependence 
of the SZE, and Py^ is an estimate of the power spectrum 
of the Compton parameter map. This estimate can be ob- 
tained from the data after a first run of the code. Finally, 
the Compton parameter map is obtained by Fourier trans- 
forming the Fourier coefficients obtained in equation (6). 

3.2 The case of known temperature 

In the Bayesian approach, one often assumes knowing the 
frequency dependence of the SZE, and that this frequency 
dependence follows the non-relativistic form, i.e 5f{x, Te) = 
in eq. (3). Since the frequency dependence of the relativis- 
tic correction depends on the unknown temperature of the 
cluster this cannot be computed when the temperature of 
the cluster is unknown. This assumption is common to all 
the existing component separation algorithms. To test how 
important is the error introduced in the recovered SZE map 
due to the previous assumption, we have simulated a pop- 
ulation of hot clusters at different redshifts (up to z = 1) 
and with different masses, but with the constraint that all of 
them must have the same temperature, T = 10 keV. In our 
simple case, where we know the temperature T = WKeV, 
we can use the real (including the relativistic corrections) 
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Figure 2. Recovered SZE map (bottom left) compared with the input SZE map used in the simulation (top left) in the 353 GHz channel. 
The right large panel shows the difference between both maps (the residual). The units are AT/T at 353 GHz. The brightest point in the 
residual is about 50 % of the original value. These bright points are located in the hottest clusters. This residual will mainly contaminate 
the CMB component. 



frequency dependence of the SZE in the component separa- 
tion algorithm since the SZE map is composed of clusters all 
with the same T and consequently, all with the same f(x, T). 
Then, we apply our Bayesian estimator to our Planck sim- 
ulations which includes galactic components, CMB, point 
sources, SZE and instrumental noise, and we recover the 
SZE component in two situations. In the first case wc as- 
sume the real f{x,T) (including the relativistic correction) 
and in the second case we assume that fix) corresponds to 
the non-relativistic case. Then we look at the difference of 
the two recovered maps. We find that the relative difference 
in the two Compton parameter maps is between ~ 4% (low 
redshift clusters, z < 0.5) and ~ 8% (high rcdshift clusters, 
2 > 0.5). This difference is smaller than the 15% difference 
between the relativistic and non-relativistic approaches at 
353 GHz (T = 10 keV). The reason for that is that the 
recovered SZE map is obtained by averaging over different 
channels where the differences between the relativistic and 
non-relativistic forms are smaller than 10% for the channels 
below 353 GHz (the two channels at 545 and 857 GHz do 
not contribute significantly to the previous average) . We can 
conclude that the effect of the non-relativistic assumption 
in the component separation algorithm is not a major worry 
(but see section 4 below). Therefore, in the case of real life 
where the population of clusters have different (unknown) 
temperatures, the non-relativistic form can be assumed in 
the component separation since it will only introduce a small 
error. As we will see in the next section, the main error in 
the SZE estimation will come from the imperfect separation 
between the SZE and the other components. 

3.3 The real case. Unknown temperatures. 

In the previous section we have seen that for a population 
of hot clusters (T = 10 keV) for which the relativistic cor- 
rections can be important, the effect of assuming the non- 
relativistic form in the component separation algorithm is 
not larger than 8%. However, even in the case when we as- 
sume the right frequency dependence, the component sep- 
aration algorithm will have an intrinsic error and the SZE 
will not be subtracted perfectly. This case was considered 
in (Diego ct al. 2002) but in that work the authors did not 
include the relativistic corrections neither in the simulation 
of the SZE component nor in the assumed frequency depen- 
dence of the SZE in the non-parametric method. 

When testing component separation algorithms with 
simulations, the frequency dependence of the SZE appears 
not only in the assumption made in the component separar- 
tion process but also in the simulation of the SZE compo- 
nent. Usually the simulations of the SZE do not include the 
relativistic correction, and as explained in the introduction, 
the dilution effect implies that if the simulations do not in- 
clude the relativistic correction, clusters will appear with a 
greater contrast at the most relevant frequencies for cluster 



detection. In real life the clusters will be dimmer due to this 

dilution effect. In this section wc will include the relativistic 
correction of the SZE in our simulation as described in ta- 
ble 1, and we will compare with the case where the correction 
is not considered in the simulation. The cluster distribution 
in the M — z space is obtained from Press-Schechter (Press 
& Schechter 1974) for a flat ACDM universe with = 0.3 
and SIa ~ 0.7. The temperature of the clusters is obtained 
from the T — M relation. For this relation we have used the 
fitting formula found by (Diego et al. 2001) where the au- 
thors fitted the T — M relation to several X-ray data. The 
electron density is assumed to follow a standard ,5— model 
(/3 = 2/3), and we use as = 0.8 (for more details see (Diego 
et al. 2002)). Once we have the temperature of the cluster, 
the relativistic correction is computed for each cluster and 
in each one of the Planck channels. In the simulated area 
of the sky (12.8° x 12.8°), there are about 20000 clusters 
in the simulation above a mass of 3 X 10"/i"^Mo which 
corresponds to a temperature w 1 keV. From those clusters 
only a few tens will be detected by Planck in this area of the 
sky. The hottest cluster in this simulation has a temperature 
T = 15 keV. The most massive cluster in the simulation has 
M = 1.1 X lO^^h-^MQ, and it is at a = 0.66. 

In Figure 2 we present this simulated SZE map com- 
pared with the SZE map recovered by the non-parametric 
method (assuming the non-relativistic form for f{x) in the 
component separation process) and the difference between 
them (the residual) in the 353 GHz channel. We do not 
present here the case where the relativistic correction is not 
used in the simulation because it looks very similar to Fig- 
ure 2 (see (Diego et al. 2002)). The recovered map contains 
positive and negative values. This is due to the wrong choice 
of SZE prior in the non-parametric method. When comput- 
ing the residual (true SZE map minus recovered SZE map), 
the negative values in the recovered map must be set to 
since the Compton parameter must contain only positive 
values. 

In the next section we will compare the recovered SZE 
map (Figure 2 bottom left) in the two cases, SZE simula- 
tion including the relativistic correction and SZE sinmlation 
not including the relativistic correction. In section 5 we will 
concentrate on the residual (Figure 2 right laxge panel) in 
the case with relativistic correction and consider the effect 
on the CMB. 

4 PLANCK CLUSTER CATALOGUE 
4.1 Thermal SZ 

The recovered map (Figure 2 bottom- left) is a noisy estimate 
of the real map (top- left). This noisy map is composed basi- 
cally of two components. A Gaussian background which con- 
tains most of the spurious signal in the recovered map and 
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Figure 3. Each dot in this plot represents a cluster in our simu- 
lation (only clusters with fluxes bigger than 60 mjy are plotted). 
The solid line represent the selection function at 60 mJy. Dots sur- 
rounded by a square are the clusters detected using the Bayesian 
approach. The 4 clusters marked with a big star are the clusters 
which are not detected by SEXTRACTOR when the relativis- 
tic corrections are included in the SZE simulations. The insert 
plot shows the underlying distribution of clusters for this model 
(dN/dz). 



a positive tail (in the pdf) which contains the galaxy clus- 
ters. In order to discriminate between the noisy background 
and the clusters we have used the package SEXTRACTOR 
(Bertin & Arnouts 1996). With SEXTRACTOR we can de- 
tect 46 clusters (above a threshold of 3<7 and with 10 pixels 
connected). We have compared this number of detections 
with the ones obtained when the original SZE map is simu- 
lated without considering the relativistic correction. In this 
case, the assumption made in the non-parametric method 
about the non-contribution of the corrections to the SZE 
frequency dependence is correct and there is no systematic 
error introduced by the relativistic corrections. 

When the simulated SZE does not include relativistic 
corrections the number of detections returned by SEXTRAC- 
TOR (with the same criteria as above) is 50. In Figure 3 we 
show the recovered clusters in both cases (SZ simulation 
with and without relativistic corrections) . We also show the 
selection functions at the flux 60 mJy. Above 200 mJy (at 
353 GHz) the method detects almost 100 % of the clusters. 
No clusters are detected below the flux 60 mJy. There are 4 
missing clusters which are not detected when the SZE sim- 
ulation includes the relativistic corrections (big stars). This 
is just an example of the dilution effect of the frequency de- 
pendence when the relativistic corrections are included. The 
hot clusters become less bright in the main Planck channels 
when the relativistic correction is included and it is there- 
fore more difficult to detect them. It is important to note 
that these missing clusters are more concentrated around 
intermediate-high redshifts {z « 0.2 — 0.8). The dilution ef- 
fect is more important for the hottest clusters which, for the 
same mass, are expected to be in the high redshift interval 
(T oc M^/^(l -I- z) in the Clm = 1 case (note that in the sim- 
ulations we use the more general formulae with Qm = 0.3 
and = 0.7)). 

The fact that the missing clusters are close to the lim- 
iting flux of the survey makes the relativistic correction an 
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Figure 4. Recovered fluxes versus true fluxes at 353 GHz. Crosses 
are the fluxes when the relativistic corrections are taJcen into ac- 
count in the simulated SZE map (but not in the Bayesian ap- 
proach) . Asterisks indicate the recovered fluxes when both in the 
simulation and in the Bayesian approach, the relativistic correc- 
tions are not included. 



important issue to be taken into account in future modeling 
of the data. The future catalogue of clusters obtained by 
Planck could be used as an independent cosmological test, 
however, in order to do that it is crucial to understand the 
selection function and completeness level of the catalogue. 
Our results siiow that an accurate estimation of both selec- 
tion function and completeness obtained from simulations 
should include the relativistic corrections in the simulations 
in order to not underestimate these quantities at intermedi- 
ate/high redshifts. 

The relativistic corrections wiU not only have an effect 
on the number of clusters which can be detected but also in 
the accuracy in the estimation of their fluxes. In Figure 4 we 
show how well SEXTRACTOR recovers the total flux of the 
cluster in both cases (with and without the relativistic cor- 
rections in the simulated SZE map). The difference between 
considering and not considering the relativistic correction in 
the SZE sinmlated map is small in the computation of the 
total flux. 



4.2 Kinematic SZ 

The previous discussion was for the thermal component of 
the SZE, but the kinematic component will also be affected. 
In this case the distortion in the CMB temperature is given 

by 

AT _ _Vr_ 

T Kin C 

where Vr is the radial (peculiar) velocity, and r the optical 
depth of the cluster. The kinematic SZE component has the 
same frequency dependence as the CMB fluctuations. Fur- 
thermore, its intensity is typically 20-30 times smaller than 



(7) 
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the thermal SZE component. This makes it extremely diffi- 
cult to measure the kinematic component. One of the best 
strategies to detect it is to apply an optimal filter for cluster 
detection (see Herranz et al. 2002) to the 217 GHz channel 
where the thermal SZE component vanishes (in the non- 
relativistic approach) and then cross-correlate the filtered 
map with the positions where the thermal SZE component 
was detected. However, when we consider the relativistic 
corrections, we cannot simply assume that the thermal SZE 
component vanishes at 217 GHz (the cross-over ^ is at « 225 
GHz for T = 15 keV). Instead, there will be a contribution of 
the thermal SZE at this frequency which can be of the same 
order of magnitude of the kinematic SZE. In order to quan- 
tify how important the contamination of the thermal SZE to 
the kinematic component is in the 217 GHz channel, we have 
compared a simulated SZE map of the kinematic effect with 
the corresponding thermal component (including relativis- 
tic corrections) at this frequency. The kinematic component 
has been simulated assuming a Gaussian distribution for the 
radial velocities with a dispersion of = F{z) * 400 km/s, 
where F{z) accounts for the evolution of the velocity field 
with redshift in linear perturbation theory, (Peebles 1980) 

The rest of the parameters arc the same as in the simulation 
of the thermal component. In Fig. 5 we present the relative 
error which, in each pixel, has been defined as 

At 



Err = 100 x 



(9) 



where At is AT/T for the thermal component and Ak 
for the kinetic one. This relative error is w in the non- 
relativistic approach. Since the radial velocity of the clusters 
can be arbitrarily small, we have to set a threshold in the 
calculation, which is AT/T = 10~^ for the kinematic effect, 
and we only compute the relative error for clusters above 
this threshold. As can be seen in figure 5, the relative error 
can be as large as 100% for some clusters and there are sev- 
eral clusters with relative errors ~ 50%. This will constitute 
a serious problem for the correct estimation of the kinematic 
SZE component in those clusters, and thus making it harder 
to constrain theories of structure formation and evolution 
from measurements of the radial peculiar velocities (see e.g. 
(Aghanim, Gorski & Puget 2001)). Since the relative error 
is proportional to the thermal SZE component which in- 
cludes the relativistic corrections, and since the relativistic 
corrections are more important for the hottest clusters, the 
relative error is (in general) larger for the hottest clusters 
and smaller for the coldest ones. 

In our calculations, we did not consider the bandwidth 
for Planck in the 217 GHz channel. However, our conclusions 
are correct if the bandwidth is symmetric around the central 
frequency since in this case the effect of integrating along the 
bandwidth is the equivalent to taking the central frequency, 
because the frequency dependence of the SZE can be well 
approached by a straight line over the bandwidth at 217 
GHz. 



t The cross-over frequency goes like /o = 217.5(1 -t- 2.27 x 
10-3 T - 3.12 X 10"^ T2) for vanishing optical depth, t Ri 0. 




Relative Error (%) 

Figure 5. The histogram shows the number of pixels above 
threshold (see text) as a function of their relative error in the 
kinematic component due to the non-subtracted thermal compo- 
nent. We have chosen a broad binning (20 bins) in order to have 
a significant number of points in the tails of the distribution. 



5 THE CMB MAP 

The SZE residual in Figure 2 will contaminate the other 
components, and in particular it will contaminate the most 
important component (from a cosmological point of view), 
the CMB. The percentage of the SZE residual which con- 
taminates the CMB will depend on the method used to per- 
form the component separation. There arc two extreme cases 
which can be considered. One in which no SZE residual con- 
tributes to the CMB map and the other case in which 100 
% of the SZE residual contaminates the CMB. The real sit- 
uation will be between the two extreme cases, and only part 
of the SZE residual will contaminate the CMB map. 

There is one basic constraint in all the component sep- 
aration algorithms which must be obeyed. The sum of all 
the components (in each pixel) must be equal to the data. 
Since most of the component separation algorithms work 
in Fourier space, this constraint must be reformulated in 
terms of the Fourier coefficients. The SZE residual is basi- 
cally contributing at small scales. This means, that at these 
scales we have not subtracted all the SZE signal. Due to the 
previous constraint, this residual will go to some of the other 
components and will contaminate the Fourier modes at the 
small scales. To understand which components are most af- 
fected by the SZE residual, we have to understand the way 
the different component separation methods work. In typi- 
cal component separation methods (and in our method too), 
the point sources are first subtracted using an optimal filter 
(or an equivalent one). Since the SZE has not been removed 
yet, no SZE residual can go to the point source component. 
Then the rest of the components are separated assuming 
some frequency dependence for all of them and a correlar 
tion matrix between the components (in our method we do 
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not need to make these assumptions). This cannot be done 
in the case of point sources since they do not have the same 
frequency dependence. This is the reason why they have to 
be subtracted first. It is usually assumed in the correlation 
matrix that the galactic components (synchrotron, free-free 
and dust) have much more power at large scales than at 
small scales. In our method we do not make this assump- 
tion. Instead the dust is partially removed by subtracting 
the 857 GHz channel times a constant which minimizes the 
variance of the difference, however, the situation is similar 
to that described above, i.e, the dust has much more power 
at large scales than at the small ones and when we remove 
the dust, wc arc basically removing a diffuse component at 
large scales while the small scales do not change substan- 
tially. This means that the power at small scales is basically 
due (after point source subtraction) , to the contributions of 
the SZE and the CMB (at even smaller scales the instru- 
mental noise dominates over the rest of the components). 
We have thus seen that the major contaminant of the CMB 
at small scales is the SZE. If we do not remove perfectly the 
SZE component at small scales, its residual will contaminate 
the CMB modes at those scales. 

In the previous discussion, we have assumed that the 
removal of the point sources is perfect. In reality the point 
source subtraction produces another residual which will con- 
tribute to small scales. This residual will have an effect on 

both the CMB and the SZE, however, the effect on the SZE 
component is small since point sources have a quite different 
frequency dependence from the SZE. Therefore, the point 
source residual will basically affect the CMB at small scales. 
However, this point is beyond the scope of this paper and 
we will concentrate only on the SZE residual. 

Now the question is how important is the non- 
subtracted SZE signal (the residual) for the CMB science. 
The basic quantity in cosmological studies based on the 
CMB is the power spectrum. In order to check the relevance 
of this residual in the CMB we have computed that quan- 
tity in the two extreme cases. Where the CMB map does not 
have any residual and when the CMB map includes all the 
SZE residual. We also compute these quantities in the chan- 
nel at 353 GHz where the SZE residual is the largest (the 
channels at 545 and 857 GHz cannot be used to obtain the 
CMB power spectrum). By taking these two extreme cases 
we can set an upper limit on the effect of the SZE residual. 
The result is shown in Figure 6. As can be seen from that 
figure, even in the extreme case where all the SZE residual 
is contained in the CMB component, the power spectrum 
is not very much affected by the SZE residual. These are 
good news since one docs not have to worry much about 
the contribution of the SZE residual to the power spectrum. 
Only at very small scales, the distortion in the power spec- 
trum due to the residual can be important but those high-fe 
modes will be dominated by the error bars. These error bars 
include instrumental noise, cosmic variance, and the error 
associated with the component separation process. If one 
takes into account all the previous error bars one finds that 
Planck will be able to estimate the power spectrum of the 
CMB up to fc « 70 {I « 2000). At those scales, the contri- 
bution of the SZE residual to the power spectrum is almost 
negligible. 




1 10 100 

k 



Figure 6. Power spectrum of the CMB map (dotted line) and 
power spectrum of the sum CMB plus SZE residual (solid line). 
The effect of the residual is small in the power spectrum. 



5.1 Non-Gaussianity 

There are, however, other studies which can be carried out 
with CMB data apart from its power spectrum. Gaussianity 
studies are very important since Gaussianity of the CMB is 
a basic prediction of single field inflationary models. As an 
alternative to inflationary models there are e.g. models in 
which the structure originates from topological defects, and 
such models predict a non-Gaussian pattern for the CMB. 
When mixed adiabatic and isocurvature models are com- 
pared with current CMB power spectrum, one flnds that 
maximum 15% isocurvature is allowed by the data (Enqvist 
et al. 2000), while pure adiabatic models give a very good 
fit. For Gaussianity studies one should be extremely care- 
ful when dealing with CMB maps. Even if the real CMB 
map is Gaussian, the map will contain some residuals which 
can be non-Gaussian. In the case of the SZE the residual is 
clearly non-Gaussian. It is, therefore, important to see if the 
non-Gaussian signal due to the SZE residual is relevant or 
not. 

The power spectrum of the CMB is a good indicator 
of the mean contribution of the signal at different scales. 
However, the power spectrum is not an estimator of the 
Gaussianity of a map, and one has to use other estimators. 
If we look at the residual in Figure 2 we see that the main 
contribution of the residual is localized in compact peaks. If 
the CMB map contains this residual, then the peaks of the 
residual will be mixed with the intrinsic peaks of the CMB. 
A good Gaussian estimator in this case should concentrate 
on the small regions where the compact peaks of the residual 
are more likely to contribute. We propose the use of the 
mexican hat wavelet (MHW) although other filters could be 
used instead. The MHW in real space is given by the second 
derivative of a Gaussian; 

$(r) = (2.0-(^)VK-^) (10) 

where we will refer to s as the scale of the MHW. In Fig- 
ure 7 we show the particular case of s = 2' in a grid of size 
20' X 20'. Although the scale used to build this MHW was 
only 2 arcmin, the MHW extends up to several arcmin (be- 
yond the antenna FWHM of the 353 GHz channel). In a few 
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Figure 7. MHW in real space. The size of the grid over which 
the MHW is defined in this plot is 30' x 30'. Although the scale, 
s, of this MHW is 3 arcmin, then the total size of the MHW is 
Ri 20 arcmin. 



words, the effect of the MHW is to magnify, with respect to 
the background, the signals with scales around the scale of 
the MHW. The idea is to look at the number of wavelet coef- 
ficients above a certain level (threshold) at different scales in 
the map of CMB plus SZE residual (plus noise) and compare 
these coefficients with the ones obtained when only the CMB 
(plus noise) is considered. We show the result for the channel 
at 353 GHZ in Figure 8, where we have changed the scale 
just to see how the contribution of the SZE residual changes 
with the scale of the MHW. In our simulations, wc have 
also included the expected Planck noise level in this chan- 
nel. The instrumental noise is simulated as a white Gaussian 
noise with an RMS of 14.4 x 10~^ per resolution element in 
AT/T units. The result is compared with the mean value 
and la error bars obtained from 100 Gaussian realizations 
of the CMB (plus instrumental noise). The mean value of 
these realizations is consistent with the expected number of 
wavelet coefficients (in absolute value) above the threshold 
for a map with 512^ MHW coefficients. This expected num- 
ber is just 0.0465% (for a threshold — 3.5a) of 512^ which is 
fa 120. This number is a constant independent of the scale 
of the MHW since the convolution of the CMB maps with 
the MHW does not change their Gaussian nature. Since the 
original map contains 512^^ pixels, the convolved map will 
contain 512^ MHW coefficients and the expected number 
of MHW coefficients (in absolute value) above 3.5cr for a 
Gaussian map will remain constant at all scales, « 120. The 
reason why wo choose a threshold of 3.5(t is because wc have 
to find a compromise between having enough statistics and 
the significance of the result. A lower threshold will produce 
a larger number of coefficients above the threshold but these 
number will be more dominated by the Gaussian part of the 
distribution. A higher threshold will select the tail of the 
Gaussian distribution which will show more clearly the non- 
Gaussian contribution due to the SZE residual. However, a 
high threshold will have a low expected number of coeffi- 
cients above the threshold in a Gaussian case. We choose 
the threshold at 3.5a because for this threshold, both the 
mean number of Gaussian coefficients and the significance 
are high enough (120 and 3.5cr respectively). As it can be 
seen in figure 8, the CMB plus SZE residual shows a clear 
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Figure 8. Number of MHW coefficients above 3.5a level as a 
function of the scale of the MHW for two Planck channels, 353 
GHz (left) and 143 GHz (right). The solid line is the number of 
coefficients (in absolute value) above the threshold for the Gaus- 
sian CMB map plus the SZE residual plus noise. For reference, 
we also show the case when the SZE is not removed at all (dotted 
line). The big black dots arc the mean value of 100 Gaussian real- 
izations of the CMB plus the corresponding noise at that channel 
and the error bars are Icr. The mean value is consistent with 
the expected number of points above 3.5a for a map with 512^ 
wavelet coefficients (in absolute value). This plot clearly shows, 
how the residual adds a significant (> 4a) non-Gaussian signa- 
ture at small scales in the 353 GHz channel. In the case of the 143 
GHz chajinel, the SZE residual does not introduce a significant 
non-Gaussian signature. In the rest of the channels, the number 
of coefficients above 3.5it are also consistent with the Gaussian 



deviation from the Gaussian case which is larger when the 
scale is smaller. The deviation is maximum at scales of the 
MHW around 3 arcmin (4.25cr at 2.8 arcmin). For the rest of 
the channels, the non-Gaussian signature of the CMB plus 
SZE residual is consistent (within the corresponding error 
bars) with the Gaussian case (see the 143 GHz case). This 
is due to two things. First, the amplitude of the SZE residual 
is smaller in the other channels. Second, and more impor- 
tant, the antenna is larger and the signal is diluted so the 
small scales cannot be magnified with the same efficiency 
as in the channel at 353 GHz. For comparison we show the 
result when our Gaussianity estimator is applied to the 143 
GHz channel. In this case, no significant non-Gaussianity is 
observed at small scales. In the rest of the channels, the an- 
tenna is even larger so the dilution of the residual is bigger. 
The main conclusion is that the 353 GHz channel can con- 
tain a significant non-Gaussian signal at small scales coming 
for the SZE residual. It is worth pointing out, that MAP will 
not measure non-Gaussianity from SZE residuals mainly be- 
cause of the lower resolution, ^ 12 arcmin at 90 GHz (i.e. 
the situation will be similar to what we observe in the chan- 
nels below 143 GHz). 

It is also worth pointing out that such non-Gaussianity stud- 
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Figure 9. Histograms of tlie 3 arcmin MHW coefficient maps. 
The dotted line is the case when only the CMB plus noise is 
considered (no residual). The solid line shows the excess when 
the the SZE residual is added to the CMB plus noise map (353 
GHz channel). 

ies could be used to determine which component separation 
methods are most useful for Planck. In future works focus- 
ing on non-Gaussianity studies, one could imagine using the 
353 GHz channel to remove the non-Gaussianity due to SZE 
residuals from the other channels. Another interesting pos- 
sibility is to use this non-Gaussianity to remove even more 
of the SZE residual. 

In Figure 9 wo show the histogram of the two wavelet 
coefficient maps (CMB plus noise and CMB plus noise plus 
SZE residual) for the case when the maps are filtered with 
a MHW with a scale of 3 arcmin. This plot shows again the 
non-Gaussian nature of the CMB plus SZE residual map at 
small scales. 

We have also applied our non-Gaussianity estimator to see 
if we can detect some non-Gaussianity coming from the con- 
tribution of the kinematic SZE. We did not find any non- 
Gaussian signatures due to this component. 



6 CONCLUSIONS 

In this paper we have studied the systematic errors intro- 
duced in the Planck cluster catalogue and in the CMB map 
due to the imperfect SZE subtraction caused by the non- 
relativistic assumption and to the intrinsic error in the com- 
ponent separation process. Wo have used the non-parametric 
method proposed in (Diego ct al. 2002) to perform the com- 
ponent separation method. This method is very robust in 
the sense that it makes a minimum number of assumptions. 
The drawback is that it is not the most precise in the deter- 
mination of the SZE component. However, this allow us to 
put an upper limit on the systematic errors introduced by 
the imperfect SZE subtraction since any more sophisticated 
method in principle should produce smaller SZE residuals. 



We have seen that the effect of the non-relativistic as- 
sumption in the frequency dependence of the SZE made in 
the component separation process is small when compared 
to the case where the real frequency dependence is assumed 
(between 4% and 8% relative difference for T = 10 keV 
clusters), however, the relativistic corrections should be con- 
sidered in the simulations of the SZE in order to compute 
correctly the selection function and completeness level of the 
cluster catalogue. 

Concerning the kinematic component of the SZE, rela- 
tivistic corrections should be taken into account in order 
to recover this component. Otherwise errors as large as 
50 — 100% could be introduced in this component in the 
most relevant channel for its detection (217 GHz). 

In the CMB, the SZE residual in the 353 GHz leaves a 
non- Gaussian signature at small scales which could be de- 
tected by some Gaussianity estimators like the MHW. This 
channel could thus be used to extract non-Gaussianity signa- 
tures from imperfect SZE subtraction from the other chan- 
nels. Our MHW Gaussianity estimator does not show sig- 
nificant deviations from Gaussianity due to imperfect SZE 
recovery in the other channels relevant to the CMB. 

The SZE residual does not change significantly the 
power spectrum of the CMB in the 353 GHz channel. It 
is therefore safe to include the 353 GHz channel for the 
computation of the CMB power spectrum. This channel, 
together with the 217 GHz channel, are the most important 
ones which will contribute to the CMB map at the smallest 
scales (« 5 arcmin). 
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